Fruit Carts: A Domain and Corpus for Research in Dialogue Systems and Psycholinguistics
نویسندگان
چکیده
We describe a novel domain, Fruit Carts, aimed at eliciting human language production for the twin purposes of (a) dialogue system research and development and (b) psycholinguistic research. Fruit Carts contains five tasks: choosing a cart, placing it on a map, painting the cart, rotating the cart, and filling the cart with fruit. Fruit Carts has been used for research in psycholinguistics and in dialogue systems. Based on these experiences, we discuss how well the Fruit Carts domain meets four desired features: unscripted, context-constrained, controllable difficulty, and separability into semi-independent subdialogues. We describe the domain in sufficient detail to allow others to replicate it; researchers interested in using the corpora themselves are encouraged to contact the authors directly.
منابع مشابه
Modelling Multi-issue Bargaining Dialogues: Data Collection, Annotation Design and Corpus
The paper describes experimental dialogue data collection activities, as well semantically annotated corpus creation undertaken within EU-funded METALOGUE project. The project aims to develop a dialogue system with flexible dialogue management to enable systems adaptive, reactive, interactive and proactive dialogue behaviour in setting goals, choosing appropriate strategies and monitoring numer...
متن کاملTowards a psycholinguistics of dialogue: defining reaction time and error rate in a dialogue corpus
This study uses the multi-level coding of a designed corpus of unscripted task-oriented dialogues to demonstrate that time to respond (Inter-Move Interval, IMI) and rate of disfluency behave like psycholinguistic measures, reaction time and error rate, in reflecting the speakers’ cognitive burdens. Multiple-regression analyses show that IMI is sensitive to social distance between interlocutors,...
متن کاملA Preliminary Investigation of Hierarchical Hidden Markov Models for Tutorial Planning
For tutorial dialogue systems, selecting an appropriate dialogue move to support learners can significantly influence cognitive and affective outcomes. The strategies implemented in tutorial dialogue systems have historically been based on handcrafted rules derived from observing human tutors, but a data-driven model of strategy selection may increase the effectiveness of tutorial dialogue syst...
متن کاملCorpus based coreference resolution for Farsi text
"Coreference resolution" or "finding all expressions that refer to the same entity" in a text, is one of the important requirements in natural language processing. Two words are coreference when both refer to a single entity in the text or the real world. So the main task of coreference resolution systems is to identify terms that refer to a unique entity. A coreference resolution tool could be...
متن کاملCoherence and Structure in Text and Discourse
Textual coherence versus discourse structure Coherence is one of the most general and most widely discussed concepts in the study of text and discourse In spite or perhaps because of its central status the concept of coherence has many di erent and often incompatible de nitions and connotations For text linguistics or psycholinguistics with their focus on the representation and processing of in...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Computational linguistics
دوره 38 3 شماره
صفحات -
تاریخ انتشار 2012